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Complex networks possess a rich, multi-scale structure reflecting the dynamical and functional 
organization of the systems they model. Often there is a need to analyze multiple networks simul- 
taneously, to model a system by more than one type of interaction or to go beyond simple pairwise 
interactions, but currently there is a lack of theoretical and computational methods to address these 
problems. Here we introduce a framework for clustering and community detection in such systems 
using hypergraph representations. Our main result is a generalization of the Perron-Frobenius the- 
orem from which we derive spectral clustering algorithms for directed and undirected hypergraphs. 
We illustrate our approach with applications for local and global alignment of protein-protein inter- 
action networks between multiple species, for tripartite community detection in folksonomies, and 
for detecting clusters of overlapping regulatory pathways in directed networks. 



I. INTRODUCTION 

Complex networks in nature and society represent in- 
teractions between entities in inhomogeneous systems 
and understanding their structure and function has been 
the focus of much research. At the macroscopic scale, 
complex networks are characterized by, among others, a 
degree distribution, characteristic path length and clus- 
tering coefficient which are markedly different from those 
of regular lattices or uniformly distributed Erdos-Rcnyi 
random graphs [HE], while at the microscopic scale, they 
contain network motifs, small subgraphs occurring sig- 
nificantly more often than expected by chance The 
intermediate level usually exhibits the presence of com- 
munities or modules, sets of nodes with a significantly 
higher than expected density of links between them, typ- 
ical examples being friendship circles in social networks, 
websites devoted to similar topics in the World Wide Web 
or protein complexes in protein interaction networks (3J- 

However, the limitations of modeling a complex system 
by a network with a single type of pairwise interaction 
are becoming more and more clear. Folksonomies, online 
social communities where users apply tags to annotate 
resources such as images or scientific articles, have a tri- 
partite structure with three types of interactions [8] [9] . 
In biology, cellular systems are characterized by different 
types of networks which represent different physical inter- 
action mechanisms operating on different time-scales, in- 
tertwined with each other through extensive feedforward 
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and feedback loops (TU1 E] ■ To understand how evolu- 
tionary dynamics shapes molecular interaction networks, 
we need to compare them between multiple species with 
non-trivial many-to-many relations between their respec- 
tive node sets |12j . In order to move beyond simple net- 
works of pairwise interactions to model these and other 
systems, one suggestion has been to use hypergraphs, 
where edges are arbitrarily sized subsets of nodes. Al- 
though a number of studies have generalized various con- 
cepts from graph theory to hypergraphs 03 [T3HI6] , a 
rigorous mathematical foundation and general-purpose 
algorithm for clustering and community detection in hy- 
pergraphs is still lacking. 

Here we present a framework for spectral clustering 
in hypergraphs which is mathematically sound and al- 
gorithmically efficient. It is based on a generalization 
of the Perron-Frobenius theorem, which allows to define 
and compute a dominant eigenvector for hypergraphs and 
use its values for optimally partitioning the hypergraph's 
vertex set, similar to the operation of standard spectral 
clustering algorithms in ordinary graphs |17j . We demon- 
strate the validity of our approach through practical ap- 
plications in the analysis of real-world networks. In par- 
ticular we address the following problems. First, if two 
networks are defined on separate node sets with a many- 
to-many mapping between them (for instance protein- 
protein interaction networks in different species), it is a 
natural question to find matching communities in the two 
networks. This is the so-called network alignment prob- 
lem [12]. We show that this problem can be solved by 
finding clusters in a hypergraph where each hyperedge 
consists of two matching edges, one from each network 
(Section VIII A). Second, if multiple networks are de- 
fined on the same node set (i.e., together they form an 
edge-colored graph), there often exist functionally mean- 
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ingful, higher-order relations between the different edge 
types (for instance tripartite relations in folksonomies 
[SI [3] or network motifs in biological networks [TP1 fTT] ). 
Finding communities or modules which respect to these 
higher-order relations is what we call the network inte- 
gration problem. Here we show that any higher-order 
edge relation between different networks defines a sub- 
graph pattern in the corresponding edge-colored graph 
and that all instances of this pattern form a hypergraph. 
Hypergraph-based clustering can then be applied to iden- 
tify module s in such edge-colored graphs (Section VIII B 
andlVIIICl). 



II. GRAPHS AND HYPERGRAPHS 



Directed hypergraphs can be defined in many ways. 
For instance for fc-uniform hypergraphs, we can impose 
any form of permutation symmetry, or lack thereof, be- 
tween some or all of the k dimensions in each edge. In 
this paper, we will only consider the case where each edge 
E can be written as a pair (S, T), where S C V is called 
the 'source' vertex set and T C V the 'target' vertex set, 
with weight function w(S,T). Underlying a directed hy- 
pergraph, there is always an undirected hypergraph with 
edges E = S U T for every directed edge (S, T). As is the 
case for ordinary directed graphs, a stronger notion of 
connectivity is usually needed than simple connectivity 
of this undirected hypergraph. We defer the somewhat 
technical definition of strong connectivity of directed hy- 
pergraphs to Appendix |A") 



A graph Q is defined as a pair (V, £) of vertices V and 
edges (pairs of vertices) £ , which may be directed or not. 
In a weighted graph, a number is assigned to each edge 
which may represent, e.g., the cost, length or reliability 
of an edge. A hypergraph is a generalization of a graph 
where an edge, called hyperedge in this case, can connect 
any number of vertices, i.e., £ is & set of arbitrarily sized 
subsets of V. A particular class of hypergraphs are so- 
called /c-uniform hypergraphs where each hyperedge has 
the same cardinality k. Algebraically, a graph can be 
represented by an adjacency matrix A of dimension N x 
N , with N the number of vertices, such that A^ = 1 
if {i,j} G £ and otherwise. For undirected graphs, 
A is a symmetric matrix and for weighted graphs, A^ 
is defined to be the weight of the edge {i,j}- For k- 
uniform hypergraphs, the notion of adjacency matrix can 
be generalized to an adjacency multi- array or tensor T, 
with T il Ak = 1 if {ix, . . . ,if~} £ £ and otherwise. For 
a general hypergraph, we define a function w on the set 
of subsets of V such that w(E) = 1 for E € £ and 
otherwise. In general, we allow weighted hypergraphs 
where w can be any non-negative function. 

A path between two vertices i and j in a hypergraph is 
defined as a sequence of vertices i — i\, ii, . . . , ik+i = j 
and edges E\, . . . , E^ such that for all m, {i m , i m +i} C 
E m . A hypergraph is called connected if there exists a 
path between any pair of vertices. A stronger constraint 
on the structure of a hypergraph is that of irreducibil- 
ity. A hypergraph is said to be reducible if there exists a 
proper vertex subset I C V such that for any i G I and 
ii, ■■■,3m £ I, w({i,ji, ■ ■ ■ ,im}) = 0, and irreducible if 
it is not reducible. For ordinary graphs, connectedness 
and irreducibility are equivalent, but for hypergraphs this 
is not the case. An irreducible hypergraph is clearly con- 
nected, but the opposite is not always true. Indeed, if 
there exists a subset of vertices / such that paths cross- 
ing from i £ / to j £ I can always be chosen to do so 
through an edge of the form {i\, . . . ,i^, j\, ■ ■ ■ ,j m }, with 
k > 2, ii, . . . ,ik € / and ji, ■ ■ ■ ,j m & I, then we can 
set w({i,ji,. . . ,j m }) = for alii € I and ji, . . . ,j m I, 
thereby making the hypergraph reducible, without break- 
ing its connectivity. 



III. DOMINANT EIGENVECTORS AND 
SPECTRAL GRAPH CLUSTERING 

Although countless measures have been designed to de- 
fine clusters in a graph [5HZ], perhaps the simplest def- 
inition is that a cluster is a subset of vertices with a 
high number of edges between them, relative to its size. 
Mathematically, for a graph with adjacency matrix A, 
the edge-to-node ratio of a subset IcV can be written 
as 

S{x) ~ \X\ ' 

where \X\ denotes the number of elements in X. The 
number of subsets of a set with N elements grows expo- 
nentially in N and hence finding the subset with max- 
imal edge-to-node ratio by exhaustive enumeration is 
computationally infeasible for large graphs. However, 
if we denote by ux the unit vector in R N which has 
u x.i = \X\~ 1 / 2 for i 6 X and otherwise, we can write S 
as a scalar product and obtain the simple upper bound: 

(x Ax) 

S(X) = (u x ,Au x ) < max ' ,.» = A max , (1) 
xeR N ,x^o \\x\\ 2 

where (x, y) = ^ i Xijji is the standard inner product on 
R N , || a; || = y/ (x, x) is the length of x, and A max is the 
largest eigenvalue of A. By the Perron-Frobenius theo- 
rem [18j . if the graph is irreducible, the dominant eigen- 
vector x, which satisfies A max x = Ax, is unique, strictly 
positive (xi > for all i), and solves the variational prob- 
lem in the right-hand side of eq. ([!]). 

Hence, to find an approximate maximizer X of S, we 
can take the set X for which ux is as close as possible 
to the dominant eigenvector x, similar to what is done in 
other spectral clustering algorithms based on the Lapla- 
cian or modularity matrices |17j . i.e., define 

X = argmax(u x , x) = argmax — : , V" x { . 
xcv xcv |Ad v frt 
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Since x > 0, X is of the form X c = {i : Xi > c] for some 
threshold value c. Instead of X, we therefore choose the 
solution of the restricted variational problem 



Xry, a X 



argmax«S(X c ). 

oo 



(2) 



as an approximate maximizer. Solving eq. |2]) is linear in 
the number of vertices, since we only need to consider the 
values c equal to the entries of x. Moreover, S(X max ) > 
S(X), and hence X max is a better approximation to the 
true maximizer of S than X. 

Thus we obtain a numerically highly efficient spectral 
graph clustering algorithm: 

1. Calculate the dominant eigenvector x using for in- 
stance a power method [19] . 

2. Find the cluster X max which solves the restricted 
variational problem in eq. ([2]). 

3. Store A max , remove all edges between nodes in 
X max from the edge set £, and repeat the proce- 
dure until no more edges remain. 

This result of this algorithm is a partition of the edges 
of the input graph. Edge clustering algorithms have re- 
cently gained popularity as they allow for overlapping 
communities where nodes may belong to more than one 
community [501 IS] • 

This procedure generalizes immediately to directed or 
bipartite graphs. In this case a cluster consists of a 
'source' set X and 'target' set Y with edge-to-node ratio 



S{X,Y) 



exjeY Aij 



The dominant eigenvector is replaced by the dominant 
left and right singular vectors x and y corresponding to 
the largest singular value of A, which are again unique 
and strictly positive [IB]. A max and Y max are found by 
maximizing S(X, Y) over sets obtained by thresholding 
on the entries of x and y. 



IV. PERRON-FROBENIUS THEOREM FOR 
HYPERGRAPHS 

Our aim is to generalize the previous graph spectral 
clustering algorithm to arbitrary hypergraphs. For this 
purpose we first need a generalization of the Perron- 
Frobenius theorem. Let % = (V, £) be an undirected 
hypergraph on N vertices. Define for x E R N and p > 1 



fc P (x) 



E* 

EES 



i£E 



(3) 



where w(E) is the non- negative weight of edge E and 
||x|| p = (J2i \ x i\ p ) 1 ^ p is the p-norm of x. We have the 
following key result: 



Theorem 1. 1Z P attains its maximum on the set of unit 
vectors S p = {u E R N : \\u\\ p — 1}. If H is connected, 
there is a unique maximizer x E which is strictly 
positive and satisfies the Euler- Lagrange equations 



\ p x- 



p _ 



E 



w(E)fyr 



LEI Ul~ J 

{E££:i£E} 1 1 j&E 



(4) 



subject to the constraint \\x\\ p = 1 and with X p — lZ p (x). 
By analogy with the matrix case, we call x the dominant 
eigenvector ofH. 

For clarity, we first prove this theorem in the simpler 
case when % is irreducible. The proof of the general case 
is given in Appendix [B] 

Proof. Existence of a maximizer on follows from 
Weierstrass's theorem [TB]. Clearly, since 7Z p (x) = 
lZ p (\x\), we can always choose a maximizer x to have 
non-negative entries. Hence we can find i as a station- 
ary point of the Lagrangian 



C(x) 



EES 



i£E 



a (in: 
p 



i 



giving rise (for non-negative x) to the Euler-Lagrange 
equations 



Xxf- 1 = 



E 



w(E) ( 



E \ V -i J- ~ J 

{E££:i£E} 1 1 j&E,jjti 



( n 



(5) 



Let / = {i e V: x.- L — 0} and i E I. Assume there 
exists an edge E = ,j m } with .,j m I. 

Then the left-hand side of cq. ([5| is while the right- 
hand side is oo. Hence such an edge cannot exist, but 
this contradicts the assumption of irreducibility of %. It 
follows that / = or x > 0. Multiplying both sides of 
eq. (pi by Xi we obtain eq. Q. Summing both sides in 
eq. (Hi over i gives A p = 7Z p (x) — max x i lZ p (x'). 

Next assume y > is another maximizer of 1Z P . Denote 
c = mxDi(xi/yi), u = cy, and z = x — u > 0. Since 
\\ x \\p = ll^llp = 1: we have c < 1 and c p < c for p > 1. 
Denote / = {i E V : Zi = 0}. For any i € /, by the 
Euler-Lagrange equations, 



= X p {4-<fyf) 



> E *>(m[(U*j 



{Ee£: i£E} 



jGE 



n 

~j£E 



Since each term in the last sum is non-negative, they 
must all be zero. Hence for any ji,...,jk & I, if 
{hJi, ■ ■ -,3k} € £ then 



0=11**. n 

m— 1 rn — 1 

k m— 1 

=E(n%»)( 



.)( n 



m— 1 n— 1 



*3mJ\ 11 ^Jt7 
n— m+1 



(6) 
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Again each term in this sum is non-negative and must 
therefore be zero, but this contradicts ji,---,jk ^ I- 
Hence edges with i G I and ji j^. G" I do not ex- 
ist, but this contradicts the assumption of irreducibility. 
Since / 0, we must have / = V or x = y. □ 

Next consider directed hypergraphs with hyperedges 
E = (S,T), S,T C V as defined before. Then define 
1Z P! q(x, y) for x, y G R and p, q > 1 



K Ptq (x,y) = 



E < 

(s,T)e£ 



'(5,r)n 



2| H 



By identical arguments as for undirected hypergraphs, 
it can be shown that for a strongly connected directed 
hypergraph, there exists a unique pair x G and y G 
such that 1Z M {x,y) > TZ Piq (x',y') for all x',y' G 
R N . These maximizers are strictly positive and satisfy 
the Euler-Lagrange equations 



A x p 



E 

{(s,T)es-. ies} 



E 

{(S,T)G£: jeT} 



2\S\ Vll^ 



(II ^ 



w(S,T) 
2\T\ 



(n«r(n 



Vi' 



j'ET 



(8) 



(9) 



subject to the constraints 



1 and with 



\ Pt q = 1Z P}q (x, y). Details are given in Appendix [B| 



V. SPECTRAL CLUSTERING AND 
BICLUSTERING IN HYPERGRAPHS 

Having a generalization of the Perron- Frobenius theo- 
rem, it is straightforward to also generalize the spectral 
clustering method. Define for IcV, 



S P {X) 



J2ecx w ( e ) 



Tl p {u x ) < K p (x), (10) 



with x the dominant eigenvector and ux G now de- 
fined by ux.i — \X\~ 1 / p for i G X and otherwise. The 
parameter p balances cluster size versus edge density. For 
p = 1, S p is the ratio of edges to nodes in X. Taking p > 1 
diminishes the influence of the denominator and progres- 
sively favors to have a high number of edges rather than 
a high number of edges per node in high-scoring clusters 
(further details in Section VII I. The spectral clustering 
algorithm becomes: 

1. Calculate the maximizer x of lZ p . 

2. Find the cluster A max which solves the restricted 
variational problem 

^max = argmaxS p (X c ) 
oo 



with X c = {i G V : x. L > c}. 

3. Store X max , remove all hyperedges between nodes 
in X max from the edge set £, and repeat the proce- 
dure until no more hyperedges remain. 

The maximizer can be calculated using a generaliza- 
tion of the power method for matrices fTH] or tensors 
|22) : starting with an initial vector x^ and defining 



A 



(0) _ 



llx^Hp = 1, we compute from x^ using 

the Euler-Lagrange equations Q in the following steps: 



„(«+!) 



A 



E 

{Ee£ : ieE} 

£ ( " +1) Hp 

r («+i) 

j j, 

(n+l) ' 
P 



,{E)( 



\E\ 



(n 



(11) 

(12) 
(13) 



iterated until the components of x^ become station- 



ary or, equivalently, \ p 
eigenvalue, i.e., 



(n) 



has converged to the dominant 



>(«+!) 



A 



< e, 



(14) 



where e is a predefined numerical tolerance threshold. 
Due to the uniqueness of x, the choice of starting vector 
is not important. By taking a non-negative one, such as 
the uniform vector x^ = [1, 1, ... , 1] T /N 1 ^, we ensure 
that the powers of 1/\E\ occurring in the Euler-Lagrange 
equations arc always defined unambiguously. Many of 
the hypergraphs occurring in real- world applications are 
not connected. In such cases it is important to ensure 
that x^ ' has support only on a single connected compo- 
nent to obtain the unique maximizer for that component. 

Although we typically view a cluster as a subset of 
vertices, it is actually a subset of hyperedges (all hyper- 
edges E C X max ) and thus can be considered as a sub- 
hypergraph as well. Higher-scoring clusters can thus be 
obtained by recursively applying the previous procedure 
to each of the clusters itself until no more subdivision 
which improves the score is found. 

For directed hypergraphs, we have a biclustering 
method. Define for X, Y C V and p, q > 1 



Sp,q(X, Y) 



E 



SCX,TcY 



w(S,T) 



\X\^\Y\^ 



Approximate maximizers A max and y max are found by 
solving the restricted variational principle 



y 

max i 1 max 



) = argmax5 )9 , g (A Cl ,y c2 ), 

(ci,c 2 ) 



with X Cl = {i E V : Xi > ci} and Y C2 = {i G V : yi > c 2 }, 
where x and y are the unique solutions of the Euler- 
Lagrange equations ([8|-(|9|, which can again be calcu- 
lated using a power algorithm. 
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VI. RELATION TO PREVIOUS WORK 



VII. ALGORITHM VALIDATION 



The matrix algorithm for clustering in a simple graph 
has its roots in a method for image pattern recognition 
[2"3~] and using the singular value decomposition to detect 
densely linked sets in directed networks goes back to the 
work of Kleinberg [21] . The novelty here lies in the defi- 
nition of a discrete cluster through solving the restricted 
variational problem, instead of using an ad-hoc cut-off on 
the eigenvector entries. For fc-uniform hypergraphs, we 
can define rescaled variables y$ = x\^ k such that maxi- 
mizing lZ p (x) becomes equivalent to maximizing 



K'(y) 



y. 



T 



■Vi k 



blip' 



with p' = kp. In this case, Theorem [T] reduces to a 
multi-linear extension of the Perron-Frobenius theorem 
to non-negative irreducible tensors of arbitrary dimen- 
sion, which has been the subject of several recent papers 
[2"5] - f2"T] (which all depend on the strong irreducibility con- 
dition) . The proof given in Theorem [I] is considerably 
simpler, holds for general connected hypergraphs and fol- 
lows more closely the proof of the matrix theorem [18] . In 
the unsealed variational problem for lZ' p , , the maximizer 
is unique for p' > k and thus it is unsuited for gener- 
alizing to arbitrary hypergraphs where the uniqueness 
condition would become p' > fc max , the maximum edge 
size in the hypergraph. This explains why we introduced 
the geometric average over the values Xj in eq. (§. To 
the best of our knowledge, Theorem [T] is the first proof 
of a Perron-Frobenius theorem for general hypergraphs. 

For k = 3, we have previously used a similar approach 
to find clusters of 3-node network motifs in integrated in- 
teraction networks [28, 215]. In this case an adjacency ten- 
sor T rst is defined to be 1 if an instance of a 3-node query 
motif or graph pattern exists between vertices (r, s, t) and 
otherwise. More generally, we can define for any fc-node 
query pattern a fc-uniform hypergraph consisting of all 
instances of the query pattern in a given graph Q. Our 
algorithm will identify clusters of vertices in Q with a 
high number of pattern instances between them, which 
often have a functional meaning in biological networks 

EH [30]. 



Another example for k — 3 concerns the analysis and 
clustering of multiply linked data [3T] [35] or multislice 
networks [33]. Here we are given a set of M directed 
or undirected graphs and define a hypergraph adjacency 
tensor as Ty m = , where A^> denotes the adjacency 
matrix of the m th graph. Clustering in this case identifies 
vertex sets which are densely connected in multiple, but 
not necessarily all, graphs. 



A. Random geometric graphs 

The dominant eigenvector of a graph's adjacency ma- 
trix is often considered as a centrality measure ('eigenvec- 
tor centrality' pQ) and is, in essence, equal to a simplified 
PageRank 34J for ranking global vertex importance. It 
may thus come as a surprise to see it playing a role in 
identifying localized clusters (however, see the references 
in the previous section). In order to demonstrate the 
validity of our approach and illustrate the statements in 
Section[V] we applied it to randomly generated geometric 
graphs of various sizes (see Appendix C 1 for details) . 

For visualization purposes, we generated as a toy ex- 
ample a random geometric graph with 100 vertices and 
radius r 2 = 0.02 (Fig. [l}^). The graph is evidently mod- 
ular and the six highest-scoring edge clusters identified 
by our algorithm (with p — I) are indicated in color. The 
profiles of the corresponding dominant eigenvectors are 
clearly localized on a subset of nodes (Fig. |Tj3) , illustrat- 
ing that in a modular network, the dominant eigenvector 
indeed indicates the location of a single cluster. Fur- 
thermore, comparing the edge-to-node ratio for each of 
the discovered edge clusters with the theoretical upper 
bound in eq. ([!]) shows that the solution of the restricted 
variational problem (eq. (|2|) must be close to the true 
maximum (Fig. [l]C). 

For a more systematic analysis we performed triangle- 
based clustering on sequences of geometric graphs 
with constant expected edge density and varying size. 
Triangle-based clustering searches for overlapping sets of 
triangles in an ordinary graph and corresponds to the 
simplest form of fc-clique clustering [35j . Here we consid- 
ered each instance of a triangle in the input graph as a 
hyperedge in a 3- uniform hypergraph to which we applied 
our spectral clustering algorithm. The parameter p can 
be used to identify clusters at different levels of resolu- 
tion. Independent of network size, there is a \ow-p phase 
where the fraction of nodes in a cluster is small compared 
to total network size, and a high-p phase where a clus- 
ter consists of a macroscopic network portion (Fig. [Tj3). 
Interestingly, at p = 1 cluster size does not depend on 
network size (Fig. |TJZ) , insert). Hence clustering based 
on (hyper)edge-to-node ratio scores (eq. (10)) does not 



suffer from a resolution limit problem where cluster size 
grows with network size irrespective of the presence of 
'natural' clusters at smaller scales [35] EZ]- As in the 
previous example, the cluster scores are always close to 
their theoretical upper bounds, demonstrating that the 
solution of the restricted variational problem is close to 
the true optimum in all cases [3H1 Fig. SI]. 



B. Edge-to- node scaling parameter 

The transition in Fig. [Tp as a function of the edge-to- 
node scaling parameter p is a general feature, indepen- 



G 



A. Randomly generated geometric graph 




B. Dominant eigenvector profiles 




30 40 
Vertex ID 



C. Cluster scores and upper bounds 



D. Relative cluster size vs. resolution parameter p 




0.9 
0.8 
0.7 
0.6 
© 0.5 
0.4 
0.3 
0.2 
0.1 
0' 













A N=200 






N=400 






— it — N=600 






O N=800 






— B— N=1000 

























































10 15 
Cluster ID 



1 



FIG. 1. (Color online) A. Example of a randomly generated geometric graph with 100 vertices and radius r 2 = 0.02, showing 
the largest connected component with the six highest-scoring edge clusters indicated by filled nodes. B. Dominant eigenvector 
profiles for the six highest-scoring edge clusters. C. Edge-to-node ratio scores (left blue bars) and theoretical upper bound (right 
red bars) for all 25 edge clusters. D. Cluster size as the fraction $ of total number of network nodes for the highest-scoring 
triangle-based cluster in random geometric graphs with N = 200, 400, 600, 800 and 1000 nodes and constant edge density 
(p = 4) as a function of p. Each data point is an average over 10 random networks. The insert shows the absolute mean cluster 
size and standard deviation over 10 random networks as a function of N for p = 1. 



dent of the actual hypergraphs used, and can be easily 
understood as follows. Assume we have a hypergraph 
n = (V,£) with N = \V\ nodes and M = \£\ hyperedges. 
Then the relative score of any set X C V with n = \X\ 
nodes and m hyperedges compared to the score of the 
total hypergraph is 

S P {X) m {N\* a 2 _ 

with ccj and a 2 de fractions of nodes and edges in X. The 
phase diagram of s p as a function of these two variables 



is independent of the actual hypergraph under consider- 
ation (Fig. [2]). Naturally, not all combinations of a\ and 
«2 are admissible. In general, there exists a boundary 
a 2 < f( a i) with f(pti) ~ ati for at w 1. In sparse hy- 
pergraphs, we typically have M ~ N 1+s with S small, 
often S — 0. Locally however, the edge density can be 
much higher. For instance in ordinary edge clustering 
m ~ n 2 and in triangle-based clustering m ~ n 3 , for n 
not too large. Hence as ot\ decreases from 1, the bound- 
ary function f{a\) will deviate more and more from the 
diagonal a 2 = a\. In Fig. [2j we have sketched a typical 
shape of a boundary function (thick line) . At p = 1 (Fig. 
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[2j top left), the contour lines of s p are straight lines and 
s p will clearly be maximal at small values of (a i, 02). As 
p increases, the contour lines become increasingly more 
concave, pushing the value where s p attains its maximum 
towards ot\ = 1. For the idealized boundary function in 
Fig. § the transition is in fact discontinuous and jumps 
from being at oi\ — 0.1 (origin of the axes) to ol\ = 1 
around p = 1.95 [bottom left]). 

Since the transition is in general sharp as a function of 
p and can even be discontinuous, we will in practice only 
use the default edge-to-node ratio score with p = 1 to 
identify dense hypergraph clusters, or use a large value 
of p (typically p > 10) to identify connected hypergraph 
components. 



C. Algorithm efficiency 

For an undirected hypergraph with N nodes, M hy- 
peredges and maximum edge size fc max , the update steps 
in the power algorithm are at most of the order fc m axAf 
[eq. (Ill] and N [eq. ( 12 ) and ( 13 )]. The number of steps 



needed to reach convergence depends on the convergence 
parameter e [eq. ( fl4| )] and therefore possibly also on the 
hypergraph size. In practice, a maximal number of itera- 
tions I max is defined and convergence manually inspected 
when / max is exceeded. Determining the optimal thresh- 
old value is at most of the order TV (number of possible 
threshold values) times M (calculation of the edge-to- 
node ratio score). Taken together, runtime is bounded 
by 

[O(fcmaxM) + O(N)} + O(MN). 

For directed hypergraphs, determining the optimal 
threshold pair over all possible combinations of entries 
of the dominant singular vector pair (x, y) is of the or- 
der N 2 M , which is often prohibitive. In such instances, 
taking 

A^nax — 

argmax 7l p . q (u Xc , y) 

c 

i^max = argmax TZ M {x,u Yc ), 

c 

where we used the same notation as in Section [VJ results 
in an approximation which is again O(MN). 



VIII. APPLICATIONS 

A. Local and global alignment of complex networks 

The core idea for applying hypergraph clustering to 
the analysis of edge-colored graphs is to translate the re- 
lation between multiple interaction types (edge colors) 
into higher-order hypergraph edges. We illustrate this 
idea by showing that local and global alignment of com- 
plex networks with a bipartite many-to-many mapping 



between their vertex sets can be naturally viewed as a 
hypergraph clustering problem. 

Network alignment is the problem of finding topolog- 
ically similar regions between two or more networks. In 
local network alignment, small subgraphs in each network 
are aligned independent of the alignment of other sub- 
graphs, whereas global network alignment aims to find 
a maximal alignment for each connected component in 
the input graphs. Network alignment methods for com- 
paring molecular interaction networks between different 
species come in two main flavors. Topological network 
alignment finds conserved regions between networks tak- 
ing only the topology of each network into account |39j . 
The second class of methods takes into account that net- 
works in different species have evolved from a common 
ancestor through gene duplication and divergence mech- 
anisms and hence that there exists a meaningful mapping 
between the nodes in each network [12]. Methods have 
been developed which assume a one-to-one mapping |40) , 
but in general a many-to-many map should be considered 

More formally, consider two ordinary graphs Q\ and Q2, 
whose vertices are connected by a bipartite graph M. 
The directed alignment hypergraph % between Q\ and 
Q2 is defined as the 4-uniform hypergraph containing the 
edges {{i,j},{k,l}) if and only if {i,j} € Q x , {k,l} G Q 2 
and {i,k},{j, 1} 6 M (Fig. [3]A_). Such alignment hy- 
peredges are also called interologs. Interolog mapping is 
routinely used to transfer annotation information from 
one organism to another [32] and interolog analysis is at 
the heart of previous network alignment methods [4"Tll43| . 
Here we propose to address the network alignment prob- 
lem by identifying hyperedge clusters in the alignment or 
interolog hypergraph. Indeed, in a local alignment, we 
search for small regions in each graph which map nearly 
perfectly onto each other, i.e., have a high density of in- 
terologs between them. This corresponds to hypergraph 
clusters which maximize S p for values of p close to one. 
In a global alignment we search for maximally matching 
regions in each graph, i.e., connected components in the 
interolog hypergraph. These correspond to hypergraph 
clusters which maximize S p for large values of p. 

We used our spectral clustering algorithm to locally 
and globally align protein-protein interaction networks 
between yeast and human, using orthology groups for 
mapping conserved proteins between both organisms (see 
Appendix C2 for details). Protein-protein interaction 
networks represent binary, undirected associations be- 
tween proteins and they are, at present, the most ex- 
tensively characterized molecular interaction networks in 
biology [TT] 01]. Typical examples of high-scoring lo- 
cal alignment clusters are conserved protein complexes 
[551 Table S2]. Fig. [3^3 shows two examples: first a 
set of proteins which map one-to-one between yeast and 
human from the MCM complex (cluster no. 19), which 
plays an important role in DNA replication and is indeed 
conserved among all eukaryotes [35]; the second exam- 
ple (cluster no. 1) is a set of components of the V-type 



p=1.00 p=1.65 




FIG. 2. (Color online) Phase diagrams of s v {a\,a,2) for p = 1, 1.65, 1.95 and 5 (left to right, top to bottom). More yellow 
(lighter gray) indicates higher values of s p ; the thin lines are contours of constant s p , while the thick line indicates a possible 
boundary of admissible states. Colors (gray scale levels) are relative to the minimum and maximum in each panel and not 
comparable between panels. 



ATPase (a proton pump) which has expanded in human 
compared to yeast by gene duplications [46] . Other local 
alignment clusters reflect more general functional net- 
works than protein complexes [351 Table S3]. Fig. [3p 
shows cluster no. 48, an example of a conserved net- 
work involved in nucleic acid metabolism centered around 
the general transcription factor TBP (SPT15 in yeast), 
the TATA-binding protein. The largest connected com- 
ponent in the network alignment hypergraph maps 651 



yeast proteins to 766 human proteins and contains 90% 
of all interologs J38J Table S4] , showing that there exists 
a high degree of network conservation at a global scale, 
consistent with previous findings using topological net- 
work alignment |39j . 
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A. Network alignment hyperedge 

(mT) — — — (phT) 

- — <!b> 



C. Example of a local yeast-human functional network alignment 





FIG. 3. A. A (directed) hyperedge in the yeast-human protein interaction network alignment hypergraph is a so-called interolog: 
a pair of interacting yeast (Y) proteins and a pair of interacting human (H) proteins connected by orthology relations (dashed 
lines). B. Examples of aligned protein complexes (cluster no. 19 left, no. 1 right). C. Example of a functional network 
alignment (cluster no. 48). In all panels, yeast proteins are white and human proteins are grey; protein interactions are solid 
and orthology relations are dashed. 



Tripartite community detection in online 
folksonomies 



Folksonomies, online communities where users collab- 
oratively create and annotate data, are examples of so- 
cial systems that cannot be adequately modeled by ordi- 
nary graphs. For instance, tagged social networks such 
as Flickr [J7J or CiteULike [J5] have a tripartite structure 
that is best modeled by a 3-uniform hypergraph [8, 9J. 
Using CiteULike as a concrete example, each hyperedge 
consists of a user who has annotated an academic arti- 
cle with a certain keyword or tag [35] (Fig. [4jV). Tradi- 
tionally, the community structure of such tripartite net- 
works has been analysed by considering one-mode ordi- 
nary graph projections of the hypergraph, e.g. by con- 
necting two users if they have annotated the same articles 
or connecting two tags if they have been applied to the 
same articles [9]. In contrast, hypergraph-based clus- 
tering preserves the tripartite structure of folksonomy 
data and reveals additional levels of community struc- 
ture. We applied our spectral clustering algorithm to a 
subset of the CiteULike data set containing more than 
400,000 (user, article, tag) entries and identified nearly 
14,000 hyperedge clusters (see Appendix C3 for details). 



The additional level of detail present in hyperedge clus- 
ters is illustrated by looking at the user, article or tag 
overlap between clusters. Fig. [4^3 shows an example of 
two hyperedge clusters formed by the same set of users 
who have annotated different sets of articles by different 
sets of tags. Only one tag, 'pattern recognition', is com- 
mon between both clusters. The remaining tags show 
that the articles in the first cluster are about collective 
computing and swarm intelligence, whereas those in the 
second cluster deal with image analysis [35J Table SI], 
which are indeed two distinct subjects within the broad 
field of pattern recognition. 

In general, we expect such sub-divisions of one-mode 
projected communities to occur at the level of users (i.e. 
the same set of users annotating different sets of arti- 
cles using different sets of tags), but much less at the 
level of articles or tags (i.e. we do not expect different 
sets of users to annotate the same set of articles using 
different sets of tags, or to use the same set of tags for 
different sets of articles). Indeed, the 100 highest-scoring 
clusters (which together contain about 20% of all hyper- 
edges) overlap predominantly at the user level, to a much 
lesser extent at the tag level, and hardly at the article 
level, while about 21 of these clusters do not have any 
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FIG. 4. A. CiteULike hyperedge which represents one instance of a user (hexagonal node) who has annotated an article 
(circular node) with a certain tag (rectangular node). B. Example of two tripartite communities where the same set of users 
(top) has annotated two sets of articles (middle) with two sets of tags (bottom). Only the two central articles and one central 
tag ('pattern recognition') overlap between the two clusters. User-tag edges have been omitted for clarity. C. Coarse-grained 
view of the CiteULike hypergraph using the 100 highest-scoring hyperedge clusters. Each node represents a cluster (node size 
proportional to number of hyperedges in the cluster) and edges represent significant overlap between clusters (overlap score 
> 0.5, edge size proportional to overlap score). Solid edges, user overlap; dashed edges, tag overlap; wavy edges, article overlap. 
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significant overlap (overlap > 50%, see Appendix C 3 for 



details) with any other cluster (Fig. |4p). Significant ar- 
ticle overlap occurs in only two instances. In both cases, 
it concerns a subset of users who have annotated a sub- 
set of articles from a larger cluster with an additional 
set of tags. Tag overlap occurs more frequently than 
article overlap, but with lower overlap percentages than 
user overlaps. Overlapping tags are typically general tags 
which can be applied to a broad spectrum of articles. 
For instance, the ten tags occurring most frequently in 
the top 100 clusters are: bibtex-import, learning, social, 
evolution, review, support, govt, non-us, collaboration, 
design. Thus we conclude that hyperedge clusters cap- 
ture topic-specific tripartite (user, article, tag) commu- 
nities which reveal more structure of the underlying data 
than user, article or tag communities based on a single 
data-dimension only. 



C. Path clustering in regulatory networks 

Unlike protein-protein interaction networks, which are 
undirected, regulatory networks, which control the cellu- 
lar response to external or internal perturbations, are di- 
rected and represent the flow of information within a cell 
[TO] . In transcriptional regulatory networks, the response 
to perturbations can be measured experimentally by ge- 
netically knocking-out a transcription factor (TF) and 
measuring the resulting changes in gene expression lev- 
els on a genome- wide scale [49]. In yeast, direct physical 
binding interactions between a TF and its target genes 
[50] as well as perturbational response data for the same 
TF [49] are available for a comprehensive set of almost 
200 TFs (see Appendix C4 for details). On average only 
3% of the genes which respond to a knock-out perturba- 
tion of a TF are also direct physical targets of that TF 
and various approaches have been proposed to under- 
stand the mechanisms of indirect regulation and propa- 
gation of network perturbations in this context |51H54j . 
It is thought that perturbational responses are organized 
in a modular way, in the sense that groups of genes will 
be affected by the knock-out of a TF through the same 
intermediate regulatory pathways. However, due to the 
variable length of these pathways, previous approaches 
for clustering in directed networks (e.g., [55™57j), which 
identify densely interacting node sets, are not directly 
applicable to this problem. 

Here we address the problem of identifying sets of 
nodes which respond to the knock-out of a TF through 
similar regulatory paths by defining a non-uniform hy- 
pergraph where each hyperedge corresponds to a short- 
est path between two nodes in the original regulatory 
network. Hypergraph-based clustering will then find sets 
of nodes with a high number of shortest paths running 
through them and such clusters form potential 'signal- 
propagation' modules, consistent with the notion that 
high information flow in a network is associated to high 
values of a node's betweenness centrality (defined as the 



number of shortest paths between all pairs of nodes pass- 
ing through a given node). To test this hypothesis, we 
calculated all directed shortest paths in the regulatory 
network of yeast between a TF and the genes differen- 
tially expressed upon knock-out of that TF. The result- 
ing hypergraph contained 1332 hyperedges between 788 
nodes and spectral clustering identified 25 non-singleton 
and 14 singleton clusters (see Appendix C4 for details). 



Topologically, there appear to exist two distinct types of 
path clusters. Combinatorial path clusters contain genes 
responding to the knock-out of multiple TFs and form a 
network of densely overlapping paths. Fig. [5]^ shows a 
combinatorial cluster of 199 shortest paths from 20 TFs 
to 186 genes involved in glycolysis and gluconeogenesis. 
Hierarchical path clusters have a layered structure, where 
the perturbational signal of usually not more than one 
TF flows to its targets via a limited number of intermedi- 
ate TFs, in a strictly hierarchical manner (Fig. [5^3). The 
functional relevance of regulatory path clusters is demon- 
strated by the fact that they contain a significant fraction 
of the genes affected by the deletion of the cluster's TF 
and that they strongly overlap with specific functional 
categories [35} Table S5 and S6]. For simplicity, we con- 
sidered here only shortest paths in the transcriptional 
regulatory network, but clearly the approach can be ex- 
tended to paths composed of multiple interaction types. 



IX. CONCLUSIONS 

Over the past decade, graph theory has become crucial 
to represent and reason about complex network data. In 
particular clustering, the detection of densely intercon- 
nected groups of vertices with few connections to the rest 
of the network, has become a standard coarse-graining 
procedure to understand the structure and function of 
complex networks. With more and more data becoming 
available to highlight different aspects of the same com- 
plex systems, a need has arisen to analyze networks with 
multiple types of interactions simultaneously. In this pa- 
per, we have proposed to use hypergraphs to characterize 
higher-order relations between simple graphs and we have 
introduced efficient algorithms for clustering and biclus- 
tering in such hypergraphs. 

Our main result is a spectral clustering algorithm for 
hypergraphs, based on a generalization of the Perron- 
Frobenius theorem for directed and undirected hyper- 
graphs. More precisely, we have shown that, like in ordi- 
nary graphs, there exists a unique, positive vector, called 
the dominant eigenvector, over the set of vertices of a 
hypergraph, which maximizes a natural generalization of 
the Rayleigh-Ritz ratio for matrices. The importance of 
this result lies in the fact that the ratio of the number of 
edges to the number of nodes in any subset of vertices can 
be expressed as the same Rayleigh-Ritz ratio, in graphs 
and hypergraphs alike. Densely interconnected clusters 
can therefore be found very efficiently by first comput- 
ing the dominant eigenvector and then converting it to 
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A. Combinatorial path cluster 




B. Hierarchical path cluster 




FIG. 5. (Color online) Examples of a high-scoring combinatorial (A, Cluster no. 6) and hierarchical (B, Cluster no. 1) path 
clusters in the yeast transcriptional regulatory network. Red (dark gray) rectangular nodes, knocked-out transcription factors 
(TFs); yellow (light gray) circular nodes, genes differentially expressed upon knock-out of the TFs; white diamond-shaped 
nodes, all other TFs. Node size, resp. edge width, is proportional to out-degree, resp. edge betweenness (defined for the 
purposes of this figure as the number of shortest paths between all pairs of cluster nodes passing through a given edge) . 
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a discrete set of vertices. Uniqueness of the dominant 
eigenvector guarantees unambiguity of the solution and 
rapid convergence of the numerical procedure, whereas 
positivity implies that the discretization can be achieved 
by setting an optimal threshold on its entries. 

Our work has been motivated by concrete problems 
of data integration in social and biological networks. We 
have given three practical examples for using hypergraph- 
based clustering in these contexts, namely the alignment 
of protein-protein interaction networks between multiple 
species using interolog clustering, the detection of tripar- 
tite communities in folksonomies and the identification 
of overlapping regulatory pathways in perturbational ex- 
pression data using shortest path clustering. Undoubt- 
edly, many more applications for hypergraph-based clus- 
tering exist in the analysis of other biological, social, com- 
puter, communication or neural networks. From a theo- 
retical point of view, we have considered the edge-to-node 
ratio as a simple quality score for clusters in graphs and 
hypergraphs. Although this score has many attractive 
properties, such as its direct relation with the dominant 
eigenvector and the absence of any resolution limit prob- 
lems, it will still be of interest to generalize clustering 
algorithms based on other quality scores from graphs to 
hypergraphs as well. Popular methods like those based 
on minimal cutsets or modularity maximization also rely 
on spectral properties of, respectively, the graph Lapla- 
cian and modularity matrix. Although certain mathe- 
matical aspects, such as eigenvalue multiplicity and its 
implications on algorithm convergence and cluster dis- 
cretization, are more complicated in these cases, we be- 
lieve our work lays the theoretical foundations for future 
studies in this direction. 



Appendix A: Strong connectivity of directed 
hypergraphs 

Consider first an undirected hypergraph H = (V, £ ) on 
N vertices. Although connectedness of H does not imply 
irreducibility, we do have the property that if there exists 
a proper subset / cV such that for all i\, . . . , ik € I and 
ii, ■ ■ ■ ,3m & I, w({h, ■ ■ ■ ,ik,ji, ■ ■ ■ ,jm}) = 0, then H 
is not connected (since there can then be no path that 
starts in / and escapes from /). Hence if H. is connected, 
no such set I exists. 

For a directed hypergraph % = (V, £ ) we can define an 
underlying undirected hypergraph % = (V,£) by consid- 
ering all possible partitions of a subset E C V into source 
and target sets, i.e. w(E) = J2{(st)- sut=e} w (S, T). 
This procedure generalizes the definition of a symmetric 
adjacency matrix B = A + A T from the asymmetric ad- 
jacency matrix A of a directed graph. Clearly, to call H 
connected, we shall ask that % is connected as defined in 
Section HU 

Now consider two subsets 7, J C V such that I U J 
is neither empty nor equal to V. Since % is connected, 
there exists vertices ix,...,ik € I, ji,---,je € J and 



hi, ... , h m I U J such that 

w({h, . ■ . ,ik,3i, ■ ■ ■ ,3t,hx, . . ■ ,h m }) > 0. 

This implies that there exists at least one partition of 
these nodes into a source and target set with non-zero 
directed weight. We ask slightly more, namely that there 
is a partition of the form 

w({h, . . . , ik, hi, . . . , h n }, {ji, . . . ,j t , h n+ i, . . . , hm}) > 0, 

i.e., the source as well as the target set should contain at 
least one element not in I or J. Note that the require- 
ment that all i's go into the source set and all j's into 
the target set is purely notational convenience, since I 
or J are allowed to be empty, as long as their union is 
not. If the above condition is fulfilled for all pairs of sets 
(I, J) , we say that the directed hypergraph % is strongly 
connected. 



Appendix B: General proof of the Perron- Frobenius 
theorem for connected hypergraphs 

Consider a non-negative maximizer x of lZ p (x) and 
without loss of generality assume ||^|| p = 1. Let again 
I = {i € V: xi = 0} and assume 7^0. Let k be the 
smallest integer for which there exists at least one set 
ii, . . . ,ik € I and at least one set ji,...,j m & I such 
that . . . ,ik,ji, ■ ■ ■ ,3m}) > 0. Such k must exists, 

since % is connected (see Appendix [A]). For e > 0, define 

\x l i ^ I 

x i = \ 

[e iei 

We will show that for e small enough, lZ p (x) > lZ p (x), 
contradicting the assumption that there can exist a max- 
imizer with zero elements. We have 

\m p P = H£ + \r\e* = 1 + \I\e*, 

or, to leading order in e, 



For the denominator of lZ p (x), we have 

E^)(n^)* 

Ee£ ieE 

{EES: Eni=<D} iEE 

+ e -Kn^r 

{EES-. Eni^H} ieE 

=n P (x)+ e *>vE)(n £ *) m - ( B2 ) 

{EES: i&E 
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From the preceding discussion, it follows that the leading 

term in e of the second term in eq. (B2| is of the order 

fc — 
e k+ m f or some k,m> 1. Hence, for e small enough, the 

k i ■ 

extra positive term of order e k +™ in eq. (B2| offsets the 



negative term of order e p in eq. (Bl), and we get, for 
some c > 0, 

ttp(£) = (1 - f<? + o(eP)){n p (x) + ce^ + o(e^)) 
= K p {x) + ce^ + o(e^ ) 



> 



TZ p (x). 



Having established that a maximizer x must be posi- 
tive, x > 0, the remainder of the proof is the same as the 
proof for irreducible hypergraphs, since in eq. ([6]), it suf- 
fices that at least one j m ^ I to arrive at a contradiction, 
which is guaranteed by the connectedness of H. 

For directed hypergraphs, the condition (and defini- 
tion) of strong connectivity in Appendix [A| is tailor made 
to ensure that the above argument still goes through. 
More precisely if (x, y) are a pair of non-negative maxi- 
mizes of TZ Pt q(x, y) (cf. eq. |7|), define / = {i e V: Xi = 
0} and J = {j 6 V: yj = 0}. Setting the zero-elements 
in x and y to a small positive value e, strong connectivity 
implies that the numerator of TZ p , q increases by a term 
of order e a with a < 1, whereas the denominator (the 
norms of x and y) can only decrease TZ p . q by a term of 
order e E i R withp, q > 1. The uniqueness argument again 
follows along the lines leading to eq. □ 



Appendix C: Network data and numerical settings 



Here we summarize the data sources and parameter 



settings used in the example applications (Section VII 
andlYn!]). 



1. Random geometric graphs 

A geometric graph with N vertices and radius r is de- 
fined by a set V of points in a metric space and edges 
£ = {(u, v) G V: < u|| < r). We generated ran- 
dom geometric graphs by sampling with uniform proba- 
bility N points in the unit square [0, 1] x [0, 1] and tak- 
ing the standard 2-norm as the distance measure. For a 
given vertex, the probability that it is connected to any 
other vertex is irr 2 . Hence if we increase N while keep- 
ing p = Nr 2 constant we obtain a sequence of random 
geometric graphs with constant average expected degree. 



2. Alignment of yeast and human PPI networks 

We obtained physical protein-protein interactions 
(PPI) for yeast from the BioGPJD [58] database and 
physical and functional PPIs for human from the Bi- 
oGRID and STRING [55] databases. The yeast network 



had 36,391 interactions between 4,847 proteins; the hu- 
man network 40,630 interactions between 9,602 proteins. 
We integrated these networks with orthology mappings 
from the InParanoid database [50]. There were 3,390 
orthology relations between 2,245 yeast and 3,255 hu- 
man proteins which had at least one interaction in their 
respective PPI networks. We performed recursive spec- 
tral clustering on the directed alignment hypergraph con- 
sisting of 2,567 interolog-hyperedges (cf. Fig. [3]^). At 
p = q = 1, 180 clusters with at least two hyperedges 
were found; 119 hyperedges had no connections in the 
hypergraph, forming singleton clusters. The complete 
distribution of hyperedges, nodes and scores for all clus- 
ters is shown in [38j Fig S2]; the functional analysis of 
the local and global alignment clusters is given in [3"51 
Table S2 and S3]. 



3. Tripartite community detection in the 
CiteULike data 

We obtained the complete 'who-posted-what' data 
from CiteULike [48] (jhttp : / /www . citeulike . org/f aq/ 
|data. adp[ ), containing (as of Feb. 1st, 2012) 16,553,642 
(user, article, tag) entries. To create a more manageable 
data set, we considered all entries from 2005, resulting in 
a hypergraph of 466,948 (user, article, tag) hyperedges 
between 4,693 users, 121,071 articles and 36,489 tags. 
Recursive hypergraph spectral clustering with p = 1 
identified 13,987 clusters with at least two hyperedges; 
4,616 hyperedges formed singleton clusters. The com- 
plete distribution of hyperedges, nodes and scores for all 
clusters is shown in [38] Fig S3]. While comparing the 
user, article and tag overlap of two hyperedge clusters, 
we were primarily interested to detect when the set of 
users, articles or tags of a smaller cluster is entirely con- 
tained in a larger cluster (cf. Fig. H). We therefore used 
the overlap score defined for two sets X and Y as 



ovi P (A,r) = 



\x n Y\ 



rin(|X|,|Y|) 



which reaches its maximum value of 1 whenever X C Y 
orFcX 



4. Path clustering in the yeast transcriptional 
regulatory network 

We obtained a network of 11,373 physical transcrip- 
tion factor (TF) binding interactions between 198 TFs 
and 3,535 target genes in yeast from [SO] and knock-out 
microarray data for 266 TFs from [49]. The knock-out 
data can be represented as a directed network of per- 
turbational interactions where each TF is connected to 
the genes which respond to the knock-out perturbation 
of that TF. 182 TFs with physical binding data also had 
knock-out data for a total of 7,090 perturbational interac- 
tions. We constructed a directed hypergraph consisting 
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of 1,332 hyperedges and 788 nodes, where each hyper- 
edge is a shortest path in the regulatory network between 
a TF and a gene differentially expressed upon knock-out 
of that TF. We defined the source set of a hyperedge as 
the knocked-out TF and the target set as the remainder 
of the path. Recursive spectral clustering identified 39 
clusters of which 14 were singletons. 



5. Supplementary data and algorithm 
implementation 

An implementation of the clustering algorithm in Java, 
together with the input data and clustering results de- 
scribed in Section |VHI| is available from the project 
homepage at http : / / schype . googlecode . com 
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